A Simplified Method for Handwritten Character Recognition from Document Image

نویسندگان

  • Mohammad Imrul Jubair
  • Prianka Banik
چکیده

This paper presents a simple and effective technique for converting handwritten textual character from paper document into machine readable form. The proposed method takes the scanned image of the handwritten character from paper document as input and shows the recognized character as its output. Using this method, the object in the converted binary image is segmented and is resized in a global size. After that, morphological thinning operation is applied on that resized object. The image with thinned object is then partitioned into several equal sizes of small cells. A value from each cell is estimated by calculating the proportion of the number of 1 intensity pixels and the number of 0 intensity pixels in the corresponding cell. All of these estimated values are then stored in a one dimensional array. Every element in that array is considered as a single feature value or an attribute for the corresponding image. The k-nearest neighbor classifier is used to classify the handwritten character into the recognized classes of characters. Feature values are estimated from training example images and the classifier is trained using the attributes. After training attribute values for sample image are extracted and passed as inputs in the k-nearest neighbor classifier and the sample image object is grouped using the training dataset into the desired character classes. The proposed technique takes less time to compute, has less complexity and shows desired performance in matching the handwritten characters with the machine readable form and in recognizing them.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Neural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten

Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...

متن کامل

Connected Component Based Word Spotting on Persian Handwritten image documents

Word spotting is to make searchable unindexed image documents by locating word/words in a doc-ument image, given a query word. This problem is challenging, mainly due to the large numberof word classes with very small inter-class and substantial intra-class distances. In this paper, asegmentation-based word spotting method is presented for multi-writer Persian handwritten doc-...

متن کامل

Skew Detection and Correction for Gujarati Printed and Handwritten Character using Linear Regression

In this paper, we have proposed approach for skew detection and correction of handwritten and printed Gujarati document using Linear Regression method/technique. Skew detection and correction is important for any recognition system as it directly affects the recognition process of characters/documents. The proposed method work involves linear regression formula for detecting angle of rotation a...

متن کامل

Different Edge Detection Algorithms Comparison and Analysis on Handwritten Chinese Character Recognition

The acquired image will has different levels of noise pollution and image distortion in Handwritten Chinese character recognition document image processing. For these situations the accurate and fast edge detection method is an important prerequisite for the recognition results. The widely used edge detection algorithms such as: first derivative-based edge detection method, second derivative ed...

متن کامل

Off-line Arabic Handwritten Recognition Using a Novel Hybrid HMM-DNN Model

In order to facilitate the entry of data into the computer and its digitalization, automatic recognition of printed texts and manuscripts is one of the considerable aid to many applications. Research on automatic document recognition started decades ago with the recognition of isolated digits and letters, and today, due to advancements in machine learning methods, efforts are being made to iden...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012